VISOR: Towards On-the-Fly Large-Scale Object Category Retrieval

نویسندگان

  • Ken Chatfield
  • Andrew Zisserman
چکیده

This paper addresses the problem of object category retrieval in large unannotated image datasets. Our aim is to enable both fast learning of an object category model, and fast retrieval over the dataset. With these elements we show that new visual concepts can be learnt on-the-fly, given a text description, and so images of that category can then be retrieved from the dataset in realtime. To this end we compare state of the art encoding methods and introduce a novel cascade retrieval architecture, with a focus on achieving the best trade-off between three important performance measures for a realtime system of this kind, namely: (i) class accuracy, (ii) memory footprint, and (iii) speed. We show that an on-the-fly system is possible and compare its performance (using noisy training images) to that of using carefully curated images. For this evaluation we use the VOC 2007 dataset together with 100k images from ImageNet to act as distractors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient On-the-fly Category Retrieval Using ConvNets and GPUs

We investigate the gains in precision and speed, that can be obtained by using Convolutional Networks (ConvNets) for on-the-fly retrieval – where classifiers are learnt at run time for a textual query from downloaded images, and used to rank large image or video datasets. We make three contributions: (i) we present an evaluation of state-ofthe-art image representations for object category retri...

متن کامل

Visor: Video Surveillance Online Repository

Aim of the Visor Project [1] is to gather and make freely available a repository of surveillance and video footages for the research community on pattern recognition and multimedia retrieval. The goal is to create an open forum and a free repository to exchange, compare and discuss results of many problems in video surveillance and retrieval. Together with the videos, the repository contains me...

متن کامل

Towards a Computer-Based Understanding of Medieval Images

Research in the field of cultural heritage requires computer vision algorithms that can automatically advance to the representational content of images. To make large scale image databases accessible it is crucial that computer-based object retrieval in images lives up to what it really is, a search through images rather than a search through textual annotations as in many current retrieval sys...

متن کامل

Performance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature

Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...

متن کامل

The Sentient Visor: Towards a Browser for the Internet of Things

As we advance towards an Internet of Things (IoT), it becomes necessary to conceive new forms of interaction between people and the everyday objects that surround them. A world full of internetworked objects with sensing capabilities requires effective tools in order to interpret contextual data gathered from the environment and services provided by smart objects. We aim to provide a browser fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012